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Abstract 

This paper provides general matrix formulas for computing the score function, the (expected and observed) 
Fisher information and the A matrices (required for the assessment of local influence) for a quite general 
model which includes the one proposed by Russo et al. (2009). Additionally, we also present an expression 
for the generalized leverage. The matrix formulation has a considerable advantage, since although the 
complexity of the postulated model, all general formulas are compact, clear and have nice forms. 
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1. Main results 

Recently, Russo et al. (2009) introduce an interesting nonlinear mixed model considering an elliptical 
distribution for the response variable. The authors also present a motivating example in a kinetics longi- 
tudinal data set which was firstly presented in Vonesh and Carter (1992) and previously analyzed under 
the assumption of normality. Russo et al. (2009) analyze this dataset considering heavy-tailed distributions 
which may accommodate "large" observations. The authors compute the score function. Fisher information 
and some influence measures, but some matrices are presented only with the (r, s) element. The first au- 
thor to compute expressions for the entries of the expected Fisher information in a multivariate elliptical 
distribution was, perhaps, Mitchell (1989). Other recent papers have adopted the same strategy, namely 
Savalli et al. (2006) and Osorio et al. (2007). Since writing a matrix by entering element by element is not 
an efficient way to do it, we present a matrix version of these quantities (considering a more general model) 
in which, besides an aesthetic improvement, one can use it for avoiding that cumbersome task. Moreover, 
the compactness of the expressions might encourage other researches to study more complex models. We 
also show matrix versions of some expectations of a variable with elliptic distribution that can be useful to 
apply in a multivariate context. 

The nonlinear model studied in Russo et al. (2009) is given by 

Vi^ f{x.i,a) + Zib^ + Ei, i=l,...,n (1) 

and, as defined by the authors, / is an mi-dimensional nonlinear function of a, Xi is a vector of covariates, 
Zi is a matrix of known constants, a is a p x 1 vector of unknown parameters and bi is an r x 1 vector of 
unobserved random regression coefficients, where {yi,bi) follows an elliptical distribution, such that 



Hi \ ind p. 
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where is an (jrii x rrii) identity matrix. 

For the purpose of avoiding numerical integrations, Russo et al. (2009) consider the marginal model, that 



is Ui Elrriiifixi, a); S^), where — ZiDZ^ + a'^Irm- The vector of parameters of interest is defined 
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as d = (a^, 7^)^, where 7 = (70, 71, . . . , 7^)^ is the vector of parameters involved in with, in this case, 
7o = cr^. In addition to the authors' suppositions, the functional form of f{xi, a.) must be known and twice 
continuously differentiable with respect to each clement of a. 
In this paper, we consider the following model, 

*~ El„,(/(a;i,a);Si('UJi,7)), (2) 

where Wi and Xi may have common components. The functional form of the covariance matrix Hi{wi,^) is 
known and twice continuously differentiable with respect to each element of 7. Since 6 must be identifiable 
in model (1), we suppose that the model fulfills this requirement. To see that model (1) is a special case of 
(2), take Wi = Zi and 'Si{Zi,j) = ZiDZj + a'^I„i.. As model (2) is not considering a specific structure for 
Si, it can represent other multivariate models. That is, model (1) can be generalized just by considering 
Ri{zi, (J2) instead of cr^Imi, where Zi is a vector of extra dispersion covariates. Then, in this context, we 
have that Y:i(wi,-f) = ZiDZj + Ri{zi, cr) and 7 = (r^, cr^)^, where Wi = (Zj , zj)^ , t is a gi x 1 vector 
of dispersion parameters involved in D and cr is a 52 x 1 vector of dispersion parameters associated with the 
model error term. We can go further and assign, for instance, a first-order autorcgrcssivc covariance matrix 
to the error terms, that is, l^i{wi,f) = ZiDZj + (t'^V{p), where Vrs{p) = p^^^''^^ /{I — p), then Wi = Zi, 
q2 = 2 and 7 = (r, a^, p)^ . In general, Tii{wi, 7) may be any structured covariance matrix with properties 
aforementioned. To keep the same notation, consider 7 = (70, . . . ,79)^, i.e., gi + 92 = 9 + 1, then, the 
number of parameters is still h = p + q + \ (here, b is fixed and h <^n). 

Russo et al. (2009) show that the score functions considering model (1) are given by 

n 1 ^ 

C/a =Y,^iJjT.T^ri and C/^, = -3 E {^-"[^i^^m] ' ^i^J^i^^i{o)^i^^i} for j = 0, . • . ,g 

i=l i=l 

where v., = -2Wg(?ij), u., = rJ^^^Ti, Ti = yi - f{xi,a.), Ji = df{xi, OL)/da^ , Si^-) = dUi/d^ij, Wg{ui) = 
dg{ui) / dui and function g{-) is the density generator function with properties defined in Russo et al. (2009). 
Notice that, the score function U^. has a typographical error. The right form is given by 

1 " 

^7.- = - 2 ^ {t^[^r'SiO)] - i;ir7sris,( .)SrVi} for j = 0, . . . , q. 

i=l 

The authors also show that the expected Fisher information considering model (1) is given by 
where 

and the (r, s) element of K.^^ is given by 

with Cj = 4:fgi/[mi{mi + 2)] and the quantities dgi, fgi and Ursi are well defined in Russo et al. (2009). Note 
that, the above score functions and Fisher information arc essentially the same of those under model (2), but 
here matrix Sj does not have the specific structure as regarded in Russo et al. (2009), it is left in the general 
form. See also that the score function and Fisher information for 7 are written in an element-by-element 
form. 

This paper is organized as follows. Section 1.1 presents a matrix version for the score function, the 
(observed and expected) Fisher information and shows an iterative re-weighted least squares algorithm to 
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attain the maximum-likelihood estimate for 9. Section 1.2 shows a matrix version for the A matrices 
presented by Russo et al. (2009) which are also appHcable for model (2). Additionally, Section 1.3 presents 
an expression for the generalized leverage in model (2). We do not present an application in this paper, 
since it can be seen just as a complementary material of Russo et al. (2009). 

1.1. Matrix version for the score function and Fisher information 

The following two matrix results will be intensively used in the computation of the expressions derived in 
this paper. Let A, B, C and Dhenxn matrices, define also A = (ai, a2, . . . , a„) and C = (ci, C2, . . . , c„), 
where Oj and c, are n x 1 vectors, then 

tr{A'^CDB'^} = yec{A)'^{B^C)vec{D) and A^BC={ajBcs} (3) 

where vec(-) is the vec operator, which transforms a matrix into a vector by stacking the columns of the 
matrix one underneath the other, "®" indicates the Kronecker product. These results and other methods 
in matrix differential calculus can be studied in Magnus and Neudecker (2007). 
Define the following quantities, 

= (o Vi) ' = ( iSri 0Sri) ' = {-yec{'E--VirirJ)) 

and Vi = (vec(Si(Q)), . . . , vec(Si(q))), where Fi has rank b (i.e., the functions / and must be defined to 
hold such condition). Then, by using (3) and after a somewhat algebra, we have that the score function and 
the expected Fisher information, considering model (2), can be written, respectively, as 

n n 

U0=Y,F^HiUi and Keg = Y,F^ HiOiH.F, (4) 

i=l i=l 

where 

Ui-ayra.-^ 2S,®sJ+^''* '\Q vcc(s,)vcc(s,)^y' • 

Fisher information given in (4) can clearly be interpreted as a quadratic form which can be easily 
attained through direct matrix operations. Thus, a joint iterative procedure for attaining the MLE of 6 can 
be formulated as the following re-weighted least squares algorithm 

^(m+l) ^ pbn)TJjirn)pira)\^ ~(™) ^ ^ ^ = 2, . . . (5) 

where the quantities with the upper script "(m)" are evaluated at 0*^"'^, = HiOiHi, Ui = H^^O~^Ui + 
Fi6 and m is the iteration counter. Under normality we have that Cj = 1, Oj = H^^ and Vi = 1, and it is 
easy to sec that this iterative procedure (under normality) is a special case of the one proposed in Patriota 
and Lemonte (2009). 

In the sequence, we provide a matrix formulation for the observed Fisher information which requires 

harder matrix operations than the one spent in the expected Fisher information. The observed Fisher 
information presented in Russo et al. (^2009), that is the same observed Fisher information considering 
model (2), is given by -Lee = - Z^"=i Lee,i, with 

" _ dLjjd) _ f Laoc,i La-y,i\ 

dOde^ \L^c,i L^-rJ 

where 

Lccci = SJ^Sri {Wg{ui)i:i + 2W;,{ui)rirJ} T.T^Ji - 2Wg{ui)[Ip ® rjT.f]Di, 
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form 



{Wg{u^)^E^+W|J{ui)r,rJ} S^^Sj^jS-V, and the element {j,k) of X-^^,,, has the 

(7) 



Si 



with 



-D, 



(11) 



(ip) 



and 



*i(r5) 



dardas 



Note that, quantities (6) and (7) are not written in a matrix form, in the following we present a compact 
matrix version of Lee- 

'dFi 



n |. 



ae 



(8) 



where 



Oi = 2Wgiui) 



Si 2Si 8) rj 

2Si (g) n 2(Si (rir7) + (virj) ® S^) 



+ 2 





Si (g) Si 



(nr7) 



(rir7) ® r-i vec(rir7)vec(rir7) 



is an mi{mi + l)xbxb array, [w^-ffj] [^-j is the bracket product of iij and (for further details 
see Wei, 1998, on pg. 188). 

In what follows, we present some matricial results on elliptical variables. Here, Elmj(0, Sj), then 

adapting the results of Mitchell (1989) for a matrix version, we have that 

a) E{riVi) = 0, 

b) E{r,rjvi) = S„ 

c) E{r,rjv1) = Adg./m^'Ei 

d) E{vec{rirJ)rJvf) = 



e) E{vec{rirJ)vec{rirJfvf) = a (^vec(Si)vec(Si)"r + 2Si S^^ 



Therefore, as we are considering a function (5f(-_) with regular properties (differentiation and integration are 
interchangeable), we have that E{ui) = 0, E{—Oi) = E{uiuJ) = Oi and, consequently, E{—Lgg) = K^g. 

1.2. Matrix version for A 

The diagnostic technique developed in Cook (1986) is a well-spread tool to check the model assumptions 
and conduct diagnostic studies. The author proposes to look at the likelihood displacement LD{(jj) = 
2{L(6) — L(0tj)} to find possible influential observations in the MLEs, where L{6) = 'Y^iLi{0) is the log- 
likelihood function and w is a s x 1 vector of perturbation restricted in an open set f2 C . It is also defined 
a vector of no perturbation as wq S n in which LD{u:o) = 0, i.e., L{0^g) = L{6). In his seminal paper. Cook 
shows that the normal curvature at the unit direction I has the following form Ce{9) = 2\£^ {Lge)~^ A£\ 
where A = d'^L{d\u3)/d6du}^ , both A and L00 are evaluated at 6 = 6 and U3 = u>o- Thus, Cdmax is twice 
the largest eigenvalue of B = —AJL^^A and dmax is the corresponding eigenvector. The index plot of 
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dmax may reveal how to perturb the model (or data) to obtain large changes in the estimate of 6. For a 
more detailed information, we refer the reader to the work of Russo et al. (2009) and the references therein. 

Note that, by using the defined quantities, we can write the 6 x n matrix A in the case weight perturbation 
(i.e., Li{9i^g) = ujiLi{d)) and the scale perturbation (i.e., the perturbed log-likelihood function Li{du,) is 
built replacing Sj with cJ^^'Si in Li{6)), respectively, by 



Hiiii,..., F„ HnUnj and A = \^F^ H^vi , . . . , Fj HnVn ) , (9) 

where the quantities with are evaluated at and 

= -2{W,{ui)+u,W'^{u,)) (^e,(^;^T)y • 

In Russo et al. (2009), the A matrix under a case weight perturbation is presented with the same typo 
of the score function. Finally, the b x N matrix A under the response perturbation (i.e., the perturbed 
log-likelihood function Li{dM) is built replacing yt with yi + in Li(d)) becomes 



where N = rrn and 

\2ri ® {Wg{ui)Im, + W'g{ui)rirJllT^)) ' 
Note that, formulas (9) and (10) are easily handled through any statistical software. 



(10) 



1.3. Generalized leverage 

In this section, we compute the generalized leverage proposed by Wei et al. (1998) . Let y = vec(yi ,...,?/«) 
and /x(q;) = vcc(/(q;, cci), . . . , /(a, a;„)). The authors have shown that the generalized leverage is obtained 
by evaluating the N x N matrix 

GL{e) = Dei-Lee)-' LoY, 

a.t 9 = 0, where De = diJ,{a.)/d9'^ and L^y = d'^(.{9)/d9dY'^ . The main idea behind the concept of 

leverage is that of evaluating the influence of Yi on its own predicted value. As noted by the authors, the 
generalized leverage is invariant under reparameterizations and observations with large GLa are leverage 
points. 

Under the model defined in (2), we have that 



and LeY = (f^ H^Gt, FjH^Gn) 



Index plots of GLa may reveal those observations with high influence on their own predicted values. 

It is worth emphasize that other models are special cases of the formulas derived in this paper. One just 
has to define f{xi,cy.) and 11i{wi,^) and flnd their derivatives. That is, the score vector and the (expected 
and observed) Fisher information as well as the curvatures and the generalized leverage (when available) of 
several works are special cases of the proposed matrix formulation (to mention just a few of them, see for 
instance, Paula et al., 2003; Savalh et al, 2006; Osorio et al., 2007; Paula et al., 2009; Russo et al., 2009). 



/Ji 0\ 
J2 



\Jn 0/ 
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2. Conclusion 



In this short communication, we presented a matrix formulation of the score function, the (expected 
and observed) Fisher information, the generahzed leverage and the A matrices under case weight, scale and 
response perturbations for a very general elliptical model which includes the nonlinear mixed-effects elliptical 
model proposed in Russo et al. (2009). The general expressions derived in this paper can be applied in many 
other models and have advantages for numerical purposes because they require only simple operations on 
matrices and vectors. 
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